Reagent Label Text Detection Using the Stroke Width Transfom
نویسندگان
چکیده
We implemented an algorithm to extract text from reagent labels to facilitate the retrieval of safety information in a laboratory using an Android mobile device. Our algorithm combined the stroke width transform and a variety of connected component filters to detect text candidates. These were then processed using Tesseract, an open-source optical character recognition engine. We concluded that Tesseract benefits from our implementation when reagent labels contain many non-text objects.
منابع مشابه
Directional Stroke Width Transform to Separate Text and Graphics in City Maps
One of the complex documents in the real world is city maps. In these kinds of maps, text labels overlap by graphics with having a variety of fonts and styles in different orientations. Usually, text and graphic colour is not predefined due to various map publishers. In most city maps, text and graphic lines form a single connected component. Moreover, the common regions of text and graphic lin...
متن کاملMOSLEH ET AL.: IMAGE TEXT DETECTION USING A NOVEL EDGE DETECTOR AND SWT1 Image Text Detection Using a Bandlet-Based Edge Detector and Stroke Width Transform
In this paper, we propose a text detection method based on a feature vector generated from connected components produced via the stroke width transform. Several properties, such as variant directionality of gradient of text edges, high contrast with background, and geometric properties of text components jointly with the properties found by the stroke width transform are considered in the forma...
متن کاملA TRAFFIC-AWARE MECHANISM TO ADJUST CONTENTION WINDOW IN 802.11E WIRELESS LANS
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملINDUCING VALUABLE RULES FROM IMBALANCED DATA: THE CASE OF AN IRANIAN BANK EXPORT LOANS
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کاملA TRAFFIC-AWARE MECHANISM TO ADJUST CONTENTION WINDOW IN 802.11E WIRELESS LANS
<span style="color: #000000; font-family: Tahoma, sans-serif; font-size: 13px; font-style: normal; font-variant: normal; font-weight: normal; letter-spacing: normal; line-height: normal; orphans: auto; text-align: -webkit-left; text-indent: 0px; text-transform: none; white-space: normal; widows: auto; word-spacing: 0px; -webkit-text-stroke-width: 0px; display: inline !important; float: none; ba...
متن کامل